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Abstract. We study stable W-length in groups, especially for W equal to 
the n-fold commutator 7„ := [x\, [x2, ■ ■ ■ [x n _i, x n ]] •• •]• We prove that in 
any perfect group, for any n > 2 and any element g, the stable commutator 
length of g is at least as big as 2 2— ™ times the stable 7 n -length of g. We also 
establish analogues of Bavard duality for words 7™ and for f$2 '■= [[£>?/], 
Our proofs make use of geometric properties of the asymptotic cones of verbal 
subgroups with respect to bi-invariant metrics. In particular, we show that for 
suitable W, these asymptotic cones contain certain subgroups that are normed 
vector spaces. 



1. Introduction 

Geometric group theory aims to produce functors from the algebraic category 
of groups and homomorphisms to geometric categories of spaces and structure- 
preserving maps. The category of metric spaces and isometries does not have 
enough morphisms for many applications, so one instead typically studies functors 
from the category of groups and homomorphisms, to the category of metric spaces 
and 1-Lipschitz (i.e. distance decreasing) maps. 

A rich source of such functors arise in the theory of bounded cohomology, in- 
troduced systematically by Gromov in |12) . In that theory, the metric spaces are 
usually normed vector spaces, and the morphisms bounded linear operators. An- 
other rich source of such examples comes from the study of (conjugation-invariant) 
norms; see e.g. [3] for a discussion, and Q3] for an application to the theory of 
mapping class groups. 

Such functors are often useful for the study of groups as dynamical objects, 
where the functor "geometrizes" the group action, and allows one to obtain a priori 
control of dynamical quantities from algebra. For example, stable commutator 
length (hereafter scl; see [6] or [1] for an introduction) has rich connections to 2- 
dimensional dynamics, symplectic geometry, hyperbolic geometry, and so on. 

A natural class of characteristic norms (those invariant under any automor- 
phism), with good monotonicity properties, arise from the theory of words. Given 
a subset W of a free group F, a W -word in G is the image of some w S W under 
some homomorphism <j> : F — > G. The W^-words in G generate a so-called verbal 
subgroup Gw (see e.g. Q2]), an d the W -length of any g £ Gw is defined to be the 
smallest number of W-woids and their inverses in G whose product is equal to g. 
For example, many authors study square length, which is W^-length in the case W 
consists of the single word x 2 . W- length (usually for W consisting of a single word 
w) has been intensively studied in finite groups, and recently some very strong the- 
orems have been obtained by Shalev and his collaborators (e.g. [301 US HI])- See 
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[19] for a survey, and an introduction to some of these methods. However, with 
some exceptions (notably [18]), W- length has not been widely studied in (general) 
infinite groups except in the special case of W = {[x, y]} — i.e. commutator length. 
Part of the problem is that VF-length seems to be such an unstructured quantity in 
general, and is exceedingly hard to compute, or even to estimate — even in finite 
groups! Therefore in this paper we propose to study a suitable "rationalization" 
of this quantity, namely stable W -length, where the stable WMength of an element 
g G Gw is defined to be the limit of the quantity (TV-length of g n divided by n) as 
n — > oo. Our aim is to generalize (to the extent that it is possible) some of what is 
known about stable commutator length to more general classes of norms. 

Perhaps the most significant difficulty in generalizing the theory of (stable) com- 
mutator length to more general stable lengths is that of linearizing the theory. The 
commutator calculus exhibits an intimate connection between the algebraic theory 
of commutators and the linear theory of (2-dimensional) homology. Commutators 
arise as boundaries in group homology, and one may obtain a duality (known as 
Bavard Duality; see [T] and [5], Chapter 2) between stable commutator length and 
certain natural group 1-cocycles called homogeneous quasimorphisms. There is no 
natural homology theory available for stable VK-length for more general W, but 
one may obtain a generalization of the theory for commutators by a geometric con- 
struction. Given a group G, one considers the Cayley graph of [G,G] taking as 
generators all commutators in G. This is a metric space (a graph), and the group 
[G, G] acts on itself by metric isometries. Because every commutator has "length 
1" in this graph, the action of the group [G, G] on itself is "almost" commutative. 
When one replaces the Cayley graph by its asymptotic cone (an infinite re-scaled 
version), the limit becomes exactly commutative, and one obtains a normed vector 
space, which can be identified with one of the natural spaces obtained from the 
homological approach. It is this geometric construction that generalizes: given any 
W, one considers the Cayley graph of Gw with all W- words and their inverses 
as generators. Since TV- words act with bounded length in the Cayley graph, the 
rescaled asymptotic cone obeys the "law" that all W-words are trivial. With more 
work, one can obtain a certain subset of the asymptotic cone where, for suitable W, 
the resulting group is actually abelian, and is in fact a normed vector space where 
one can establish the analogue of Bavard duality. This program is most successful 
when W is an n-fold commutator j n , in which case we are able to establish (for 
perfect groups G) a precise analog of Bavard duality, and to prove the existence 
of two-sided estimates of stable 7„-length in terms of stable commutator length. 
This comparison theorem is a genuinely stable phenomenon, and does not hold for 
"naive" 7„-length (for a precise statement of results, see § 11. ip . 

1.1. Statement of results. We now give a precise statement of the main results 
in the paper. 

In § [2] we introduce terminology, and establish basic properties. We show that 
there are various inequalities relating WMength and stable WMength in various 
groups and for various different W . We use the notation (here and elsewhere) of 
1(*|W) for W-length, and sl(*|W) for stable W-length, with cl(*) and scl(*) for the 
special case of commutator length and stable commutator length respectively. 

We give simple examples of groups where stable w-length is nontrivial for essen- 
tially all words (free groups, hyperbolic groups) and where stable w-length is trivial 
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for all w (SL(2,A) for certain rings of algebraic integers A). We also derive some 
nontrivial estimates (usually upper bounds) on si in free groups. For example, 

(Cor. [2.17p sl(u>|u>) inequality. For any w there is an inequality 

. scl(ui) , . . . 

1/2 < 77 r -, , n < si w w < 1 
' ~ scl(tu) + 1/2 - v 1 ' ~ 

We strengthen the upper bound by an explicit construction, for certain natural 
classes of words. Let j n denote the n-fold commutator, so 71 = x, 72 = [x,y], 
73 = [x, [y, z]] and so on. We obtain the estimate 

(Prop. I2.28p 7„ inequality. For any n there is an inequality 

Sl(7«|7n) < 1-2 1 -" 

§[3]is the heart of the paper. To any group G and any W, we study the asymptotic 
geometry of the Cayley graph Cw of Gw with all VF-words as generators. A limit 
of rescalings of Cw gives rise to the asymptotic cone Aw ■ The cone Aw is itself a 
group, with a bi-invariant metric, obeying a nontrivial law. It contains a canonical 
contractible subgroup Aw which metrically encodes the values of stable VK-length 
on Gw- Our results are most definitive for certain specific W; in particular, 

(Thm. 13. lip . Suppose Aw is nilpotent (for example, if W = 7„ for some n). 
Then it is a normed vector space (in particular it is abelian). 

Moreover, if /?2 denotes the word [[x, y], [2, if]], 

(Thm. [3TT4]) . Suppose in some group G that stable commutator length vanishes 
identically. Then Aw is abelian (and hence a normed vector space) for W — fii. 

The fact that these asymptotic cones are normed vector spaces lets one study 
their geometry (and thereby stable W-length) dually, via 1-Lipschitz homomor- 
phisms Aw — > M. Motivated by Bavard duality, we say a function <fi '■ G — > K is 
a weak 7„-hoq ("homogeneous quasimorphism" ) if it is homogeneous (i.e. 4>{g n ) — 
n(j){g)) and if there is a least non-negative real number D(<j>) (called the defect) such 
that for any g,h G G there is an inequality 

\<t>{g)+<j>{h)-cl>{gh)\ <£>(0)nii]ia(5|7«-i),l(/i|7«-i),l^l7«-i)) 

(Thm. [3723]) 7„-Duality theorem. For any perfect group G, and any g € G 

there is an inequality 

sup 0(g) /D(4>) > sl(<?|7„) > su P 0( 5 )/2^(0) 

where the supremum is taken over all weak "f n -hoqs. 

From this we derive the following rather surprising estimate, comparing stable 
commutator length with stable 7„ length in any perfect group: 

(Thm. I3.25P 7„-Comparison theorem. For any perfect group G, for any n > 2 

and for any g S G there is an inequality 

2"- 2 scl( 5 ) > sl( 5 | 7 „) > sclQ?) 

Together the 7„-Comparison theorem and the 7„-Duality theorem show that 
stable commutator length in a perfect group G can be bounded from below by 
weak 7„-hoqs. It would be interesting to try to find naturally occurring examples 



4 



DANNY CALEGARI AND DONGPING ZHUANG 



of such functions arising from realizations of G as a group of automorphisms of 
some geometric object. 

In a similar vein, we say <p : G — > K is a weak /32-hoq if it is homogeneous, and 
if there is a least non- negative real number D((f>) (called the defect) such that for 
any g,h £ G there is an inequality 

+ 4>(h) - <f>(gh)\ < D(4>) mm(i(<7|/3 2 ), \(h\fo), l(gh\(3 2 )) 

(Thm. [3.3ip /^-Duality theorem. For any perfect group G in which scl vanishes 
identically, and for any g £ G there is an inequality 

su P 2<Hs)AD(0) > slfolft) > su P 0(.g)/2^(0) 
where the supremum is taken over all weak ^-hoqs. 

Note that stable commutator length is known to vanish identically in many impor- 
tant classes of groups, including 

• amenable groups 

• lattices in higher rank Lie groups [4] 

• groups of PL homeomorphisms of the interval [5] 

• groups that satisfy a law [7] 

It is not known whether stable VF-length for various W vanish in these classes of 
groups. One of the aims of this paper is to develop tools to approach this question. 

Finally, in § 21 we give an elementary construction, using hyperbolic geometry, 
of a class of elements w n in free groups of various ranks for which scl(w;„) = 1/2 
and sl(w„|73) = 1, but for which c\{w n ) = 1 and \{w n \^) > 2n/3. 

1.2. Acknowledgments. We would like to thank Frank Calegari, Benson Farb, 
Denis Osin and Dan Segal for helpful comments. We would also like to thank the 
anonymous referee for a very careful reading and for catching several errors. Danny 
Calegari was supported by NSF grant DMS 1005246. 

2. Basic properties 

The purpose of this section is to standardize definitions and notation, and to 
survey some elementary constructions in the theory. 

Notation 2.1. We use the notation [x, y] for xyx~ 1 y~ 1 and x y for yxy" 1 . With 
this convention, exponentiation obeys (x v ) z = x zy . We also use the notation x* for 
some (unspecified) conjugate of x. 

The following identities, though elementary, are useful. 

Lemma 2.2. The following identities hold (the letters denote free generators): 

(1) [x, y] = [y x : x' 1 ] 

(2) [x^y]- 1 = [y,x] = [x^jT 1 ] 

(3) [x,yz] = [x,y][x,z] v = [x,y][x,z][[z,x],y] 

(4) [xy,z] = [x,z][y,z} xZ = [x, z }[y, z] [[z, y], x z ] 

(5) [[y,x},z x ][[x 7 z},y z }[[z,y},x y } = 1 

Bullet (5) is known as the Hall- Witt identity. 
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2.1. Verbal subgroups. 

Definition 2.3 (W-woid). Let F be a free group, and W a subset of F. A W -word 
in a group G is the image of some w € W under a homomorphism F — > G. 

Definition 2.4 (verbal subgroup). Let G be a group. A verbal subgroup of G is the 
subgroup generated by the 14^-words, for some W C F. The W-verbal subgroup of 
G is denoted Gw- 

The group G is said to obey the law W if Gw is trivial. A group is said to obey 
a law if Gw is trivial for some nonempty W. 

For any sets V, W C F with Fy — Fw (for example, if W C V C i*V) and for 
any group G, the verbal subgroups Gy and Gw are equal, though the V-words and 
the VF-words in G might not be equal as sets. 

Since the image of a W-word under any automorphism is also a VF-word, verbal 
subgroups are characteristic (i.e. invariant under every automorphism). In particu- 
lar, a conjugate of a VF-word is a W-word. In a free group, it is tautologically true 
that characteristic subgroups are verbal. 

In the sequel we arc interested in verbal subgroups Gw where W is a single 
element of F. For the sake of legibility, we use a lower case w to denote a single 
element of F, and likewise denote the w-subgroup of G by G w . In fact, we are 
interested in certain explicit words w. The main w of interest in this paper are the 
n-fold commutators. 

Definition 2.5. Here and in the sequel, we let 7„ denote the n-fold commutator 
of the generators of F n . That is, 

li = x, j 2 = [x,y], j 3 = [x,[y,z\], 74 = [x, [y, [z, w]]] 

and so on. 

Definition 2.6. A word w G F is reflexive if w _1 is a w-word in F. 

Lemma 2.7. The words 7„ are all reflexive. Moreover, any nested bracket of n 
elements is a 7„ word. 

Proof. The second statement means, for example, that an expression like [[x, y], z] is 
a 73 word, that [i, [[[to, [it, v]], z],y]] is a 75 word, and so on. This and the reflexivity 
of 7„ follow inductively from Lemma \2?2\ bullet (2). □ 

For any n and any G, the subgroups G 7n are usually denoted G„ and called 
the lower central series. The successive quotients G/G„ are (universal) (n — 1)- 
step nilpotent quotients of G (so that for instance G/G2 is abelian, G/G3 is 2-step 
nilpotent, and so on). 

Definition 2.8. Here and in the sequel, we define /3i = [x, y], and for n > 1 we 
let /3 n denote the commutator of two copies of /3„_i (in different generating sets). 
Hence 

Pi = [x,y], p2 = [[x,y],[z,w\], /3 3 = {[[x,y],[z,w]],[[s,t],{u,v]]] 
and so on. 

Lemma 2.9. The words f3 n are reflexive. 



Proof. This follows inductively from Lemma \2?2\ bullet (2). 



□ 
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The subgroups Gp n arc usually denoted G^ n > and called the derived series. The 
successive quotients G/G^ are (universal) n-step solvable quotients of G. Note 
that fiy = J2- 

2.2. WMength and to-length. 

Definition 2.10 (W-length). Let W C F be given. Let G be a group, and let Gw 
be the subgroup generated by VT-words. If g E Gw, the W -length of g, denoted 
l(ff|W), is the minimum number of W- words and their inverses in G whose product 
is g, and the stable W -length of g, denoted sl^W), is the limit 

sl(g\W) = hm ™ 

If W consists of a single element, we usually denote it by a lower case to, and 
define w-length and stable w-length. We are typically concerned with this case in 
the sequel. 

If w is reflexive (which shall usually be the case in the sequel) , the inverse of a 
ui-word is a w-word. By abuse of notation we will sometimes refer to both w- words 
and their inverses as w-words. The meaning should be clear from context in each 
case. 

Notation 2.11. If we need to emphasize that g is in specific group G, we use the 
notation l G (g|V^) and s\ G (g\W). 

In the special case that to = [a;, y] in the free group generated by x and y, we 
refer to to-length as commutator length, and stable to-length as stable commutator 
length, and denote them by cl(-) and scl(-) respectively. 

Notation 2.12. We abbreviate the product of n (arbitrary) w-words by (or 
Win} £ or worc l s ). This should not be confused with w n , the nth power of a 
specific word w. 

Lemma 2.13. Suppose w is not in [F,F]. Then sl(-|tu) is identically zero in every 
group. 

Proof. Since w is not in [F, F], there is a surjective homomorphism F — > Z sending 
w to some nonzero n. Without loss of generality, we may assume n > 0. But then 
in any group \(g nm \w) < 1 so s\(g\w) =0. □ 

Lemma 2.14. The functions \(-\W) ands\(-\W) are characteristic, and their values 
do not increase under homomorphisms. 

Proof. The group Aut(G) permutes the canonical generators of Gw, proving the 
first claim. The image of a VF-word under any homomorphism is a W^-word, proving 
the second claim. □ 

By convention, we set sl(«7|W) = s\{g n \W)/n whenever g n e Gw (even if g is 
not necessarily in Gw)- 

2.3. Inequalities. If to and v are contained in free groups F and F' respectively, 
by abuse of notation we think of to and v as being contained in the single free group 
F * F' . By convention therefore we think of all abstract words as being contained 
in a single (infinitely generated) free group, which we denote hereafter by F. 
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Lemma 2.15 (fundamental inequality). For any v,w £ F and any g £ G there is 
an inequality 

1g(s|«) < \g{9\w)\f{w\v) 

Proof. Write g minimally as a product of w-word or their inverses, and then rewrite 
each of these as a minimal product of w-words or their inverses. This gives an 
expression for g as a product of u-words and their inverses. □ 

Note that for this lemma to make sense we must have g £ G w and w £ F v . In the 
case v = [x, y] we obtain c\c(g) < \a(g\w)clF(w). In the case of stable commutator 
length, one obtains a better estimate 

Lemma 2.16 (scl inequality). For any w £ F and any g £ G and n £ Z there is 
an inequality 

scl G (<f) < l G (g n \w)sc\ F (w) + §(1 G (<?» - 1) 

Moreover, 

sd G (g) < sl G (g\w)(scl F (w) + \) 

Proof. Express g n as a product w\ ■ ■ ■ w m of w-words or their inverses, where m = 
lo(g n \w). For any even k we have 

nk I \ k k k 

g ={wi---w m ) =w 1 ---w m v 

where v is a product of at most (m — l)k/2 commutators (see e.g. [6], Lemma 2.24 
for the case m = 2, and apply induction). Taking the limit as k — > oo we obtain 
the first inequality. 

Taking the limit n — > oo and stabilizing gives us the second inequality. □ 

Corollary 2.17 (sl(iu|«;) inequality). For any w £ F there is an inequality 

scl(w) , , . . 

1/2 < 77 , , < si w w < 1 
' ~ scl(w) + 1/2 - v 1 ' ~ 

Proof. The first inequality follows from the fact that scl(ui) > | for any nontrivial 
w in a free group (see e.g. [6], Theorem 4.111). The second inequality follows from 
Lemma 12.161 by setting g = w. The last inequality follows from the tautological 
expression of w n as a product of n copies of w. □ 

A natural question to ask is the following: 

Question 2.18 (strict inequality). For which w £ F is there is a strict inequality 

s\(w\w) < 1 ? 

This question has a positive answer for several interesting classes of words. 

Example 2.19. For each g, let x±,y±, ■ ■ ■ ,x g ,y g be free generators, and let w g — 
[xi,yi] ■ ■ ■ [x g ,y g ]. Since scl(u' g ) = g — | (see e.g. Bavard pQ) we obtain a lower 
bound sl(wg\w g ) > l — l/2g. On the other hand, an expression of u>™ as a product of 
n(g— 1/2) +o(n) commutators is also an expression as a product of n(g—h)/g+o(n) 
w g -words. So 

sl([»i,yi] • • • [x g ,y g ]\[xi,yi] ■ ■ ■ [x g ,y g ]) = 1 - l/2g 

From the definition, sl(g|w) < \{g n \w) /n for any n. On the other hand, this 
inequality can often be definitely improved: 
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Lemma 2.20. For any w G F and any g € G, there is an inequality 

sl G {g\w) < (l(.g"H - 1 + sl(to|«;))/n 
Proof. Suppose g n — w\ ■ ■ ■ w m where m = l(g n \w). For any k we can write 
g nk — w\ - ■ ■ w m w\ ■ ■ ■ w m ■ ■ ■ W\ ■ ■ ■ w m (mk terms) 

= w k w[w' 2 ---w[ m _ 1)k 

where each w\ is a w-word (since xy = y(y~ l xy), and any conjugate of a w-word 
or its inverse is a w-word or its inverse). In particular, if k is large, then w k can 
be expressed as a product of ks\{w\w) + o(k) w-words and therefore g nk can be 
expressed as a product of ks\{w\w) + (to — l)fe + o(k) w-words. Taking k large, we 
obtain the desired inequality. □ 

Corollary 2.21. If\(w n \w) =m then s\(w\w) < (m — l)/(n — 1). 

Proof. Substitute w = g in Lemma [2.201 □ 

We end this section with a couple of examples, illustrating the range of possibil- 
ities one can expect in certain specific classes of groups. 

Example 2.22 (Free group). If F is a free group, scl(<7) is positive for every g € 
[F,F] (in fact, scl(.g) > 1/2). So for every w with F w C [F, F] we get s\ F (g\w) > 
sclp(g) / (sc\f(w) + 1/2) > 0. In other words, for every w for which sl(*|w) is not 
identically zero in every group, it is positive on every element in a free group. 
A similar phenomenon holds in groups for which scl is typically positive, such as 
hyperbolic groups (see e.g. [6], Chapter 3). 

Example 2.23 (SL(2, A)). Carter-Keller-Paige [23] Thm. 6.1. show that if A is the 
ring of integers in a number field K containing infinitely many units, and if T is 
an element of SL(2,A) which is not a scalar matrix (i.e. not of the form A • id) 
then SL(2, A) has a finite index normal subgroup which is boundedly generated by 
conjugates of T. That is, in this finite index normal subgroup, every element can 
be written as a bounded number of conjugates of T . 

For any w, one can find T in SL(2, A) which is a ui-word but not a scalar matrix 
(to see this, observe that SL(2,A) contains SL(2,Z) which contains a free group, 
none of whose nontrivial elements is a scalar matrix). It follows that SL(2, A) has 
a finite index subgroup in which every element has finite w-length. Consequently 
sl(*|u>) vanishes identically in SL(2, A) for every w. 

A similar phenomenon holds in groups such as SL(n,Z) when n > 3. 

2.4. Culler's identity. Culler [10] , in his analysis of commutator length in free 
groups, discovered the beautiful identity 

This identity expresses algebraically the geometric fact that a once-punctured torus 
admits an (irregular) cover of degree 3 with one boundary component. We can get 
a lot of mileage out of this identity; this is not really for a very specific reason, 
rather because this is the simplest identity which certifies that scl([a;, y]) < 1, and 
we are able to "bootstrap" this identity to show sl(w|u;) < 1 for many w. 
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Example 2.24 ([x,y n ]). If we set w — [x,y n ] then we obtain \(w 3 \w) < 2 and 
therefore sl(w|w) < \. Together with Corollary 12 . 1 71 this implies 

S l([x,y n }\[x,y n }) = ± 

Example 2.25 ([x,y] 2 ). If we denote a = [y x ,x v 1 x~ 2 ] and b = [x v 1 ,y 2 ] then 
Culler's identity says 

{[x,y] 2 f = {[x,y] 3 ) 4 = abababab 
= aabbcaabbc for c = [b^ 1 , b~ a~ b~ ] 
= a 2 b 2 c 2 {a') 2 (b') 2 

for suitable commutators a',b'. Hence y] 2 ) 6 1 [x, y] 2 ) < 5 and therefore 

l<sl([x,yf\[x,y] 2 )<l 

2.5. 7„ inequalities. Generalizations of Culler's identity can be obtained from a 
uniform topological argument. 

Lemma 2.26 (generalized Culler identity). For all x, for any n and any y, there 
is an identity of the form 

[x,y] 2n+1 = [*,y n+1 ]* - product of n words of the form [*,y]* 

Proof. Let F = (x,y). For each n, we construct a permutation representation 
p n : F — > S^n+i- We think of an element of San+i & s a bijective function from 
{0, 1, • ■ • , 2n} to itself, and denote such a function by an ordered string of values. 
With this convention, we define 

p n {x) = 2n 2n - 1 2n - 2 • • • 1 

p n (y) = 123---n0n+l---2n 
In words, p n (y) cycles the first n + 1 elements and fixes the rest, while p n (x) is the 
involution which reverses the order. 

If we think of F as the fundamental group of a once-punctured torus S, then 
x and y can be represented by embedded essential loops a and j3 intersecting in 
one point. Since j3 is homologically essential, it is non-separating. Let S' be the 
2n+ 1-fold cover of S whose monodromy representation is p n . By the definition of 
p n , the permutation p n {[x,y\) is a single 2n+ 1-cycle, and therefore dS' consists of 
a single boundary component, mapping to OS with degree 2n+ 1 under the covering 
projection. 

Likewise, /3 has n + 1 preimages; one (call it /3o) maps to /3 with degree 
The others /3$ for 1 < i < n map with degree 1. We claim that there are disjoint, 
embedded curves oti in S' so that each on and /3j intersect transversely in one point, 
and cti does not intersect j3j for i =/= j . Evidently the lemma follows from this. 

To prove this claim we first show that U/3j is non-separating. Let T be a compo- 
nent of S' — UPi . Then T covers 5-/3, and consequently T n OS' is nonempty. But 
this implies T n 9S" = 9S", since 9S" is connected. Since T was arbitrary, every 
component of S' — U/J; contains dS', and therefore S" — U/3, is connected and Uft 
is non-separating, as claimed. 

It follows that S' — Uif3i is connected, and therefore (by reason of %) is homeo- 
morphic to a sphere with 2n + 3 holes. We can therefore find n+1 disjoint properly 
embedded arcs in 5' — 1+/?^, each of which runs between the two boundary com- 
ponents corresponding to a single This proves the claim, and therefore the 
lemma. □ 
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As a corollary, we obtain the following inequality: 

Proposition 2.27 {[x, w] inequality). For any word w, the word [x, w] (where x is 
an extra free generator) satisfies 

\([x,w} 2n+1 \[x,w}) <n + l(w n+1 \w) 

and consequently 

s\([x,w}\[x,w\) < - 

Proof. The proposition follows from Lemma 12.261 together with the observation 
that can be written as a product of \(w n+1 \w) [x, u>]-words, by bullet (3) 

of Lemma O □ 

As an important special case, one deduces: 
Proposition 2.28 ("f n inequality). For any n there is an inequality 

Sl(7nl7n) < 1-2 1 "" 

Proof. For n = 1 this follows from sl(x|x) = 0, a special case of Lemma r2.13l Then 
the result follows from Proposition 12.271 and induction on n. □ 

Question 2.29. Is the estimate in Provosition \2.28\ sharv? 

3. Geometry of Gw 

In this section we begin a more systematic study of stable I'F-length, from a 
geometric point of view. The main tool is the geometry of Cayley graphs, and 
their asymptotic cones. A verbal subgroup of any group admits a tautological, 
characteristic, bi-invariant metric, and it is the asymptotic geometry of this metric 
that gives us insight into stable VF-length. 

3.1. Cayley graph Cw Let G be a group, and Gw its verbal subgroup associated 
to some subset W C F. If w is a single element, we assume it is reflexive. Otherwise, 
we assume that W is symmetric; i.e. W — W^ 1 . 

Let Cw(G) (or just Cw or C if G or W are understood) denote the Cayley graph 
of Gw with all W-words in G as generators. We give C the structure of a (path) 
metric space, by declaring that each edge has length 1. This induces a metric on 
Gw which we call the W -metric, where the distance from g to h is the VF-length 
of g~ x h. 

Lemma 3.1. The W -metric is both left and right invariant for the action of Gw 
on itself. Furthermore, Aut(G) acts on Gw by isometries, and any homomorphism 
G — > H induces a 1-Lipschitz (simplicial) map from Cw{G) to Cw{H). 

Proof. For any f,g,h G Gw we have 

d(fg,fh)=l(g- 1 h\W) = d(g,h) 

and 

d(gf, hf) - KGr 1 /*)'" 1 \W) = Kg-'hlW) = d(g, h) 

The latter two properties are restatements of Lemma 12.141 in geometric language. 

□ 
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From the construction, the stable PF-length of an element g € Gw is just the 
translation length r(g) of g on Gw] i-e. the limit r(g) = linin-^oo d(id, g n )/n. 

The large-scale geometry of commutator subgroups (i.e. the case w = [x, y]) was 
studied in [8]. 

3.2. Asymptotic cone. Though Cw is an interesting geometric object in its own 
right, stable W-length can be more easily studied in an object derived from Cw, 
namely the asymptotic cone; see e.g. Gromov [13] Chapter 2 (especially p. 36) for 
an introduction to asymptotic cones in group theory and their properties. The con- 
struction of an asymptotic cone depends (in general) on a highly non-constructive 
choice, namely the choice of a non-principal ultrafilter. However, we will shortly re- 
strict attention to a subset of the asymptotic cone whose geometry does not depend 
on any choices, and therefore our discussion of such objects is very terse. 

Essentially, a non-principal ultrafilter w picks out a limit of every absolutely 
bounded sequence a n , which we denote lim w a n . We can think of lim w as a function 
from £°° to R; this function is a ring homomorphism. The "limit" lim^ is always 
contained between limsup and liminf, and is equal to the honest limit of some 
infinite subsequence of the a n (so, for instance, if lim„_ i . 00 a n exists it is equal to 
lim w a n ) but otherwise satisfies no a priori constraint. For more details, see Gromov 
op. cit. 

Fix (once and for all) a non-principal ultrafilter uj. 

Definition 3.2. Given a subset W C F and a group G, define the asymptotic cone 
Aw(G) (or just Aw or A if G is understood) to be the ultralimit of the sequence 
of metric spaces Cw{G) with the W- metric rescaled by a factor of 1/n. 

Thus a point in Aw(G) is an equivalence class of sequence {a n } with <i(id, a n ) — 
0(n), where d({a n }, {b n }) = lim^ d(a n , b n )/n, and where {a n } and {&„} are in the 
same equivalence class (denoted {a n } ~ {b n }) iff lim^ d(a n , b n )/n = 0. 

Proposition 3.3. The asymptotic cone Aw has the structure of a group, with a 
bi-invariant metric, satisfying the law W . 

Proof. The group structure on A is defined by taking {id} (the constant equivalence 
class) as the identity, defining multiplication by {a n } ■ {b n } — {a n b n }, and inverses 
by {a n } 1 = {a" 1 }. To see that this is well-defined, suppose {a n } ~ {a' n } and 
{b n } ~ {b' n }- By definition, we have a n = w n a' n and b n — v n b' n where ^u^lM^) = 
d(a n ,a' n ) and l(v„|W) = d(b n ,b' n ). Hence 

a n b n = w n a' n v n b' n = w n Vn"a' n b' n 

so 

lim d(a n b n ,a' n b' n )/n < lim(d(a n , a') +d(b n ,b'))/n = 

The bi-invariance of the metric on A follows immediately from the definition of the 
multiplication, and the bi-invariance of the metric on C. 

Finally, if {ai jn }{a2, n } ■ ■ ■ {a m ,n} is a W-word in A (with its group structure), 
then ai jn a2, n • ■ ■ o-m,n is a W-wotd in G for each n, and therefore 

{ai,n}{02,7i} ' • ' { fl m,n} = {^l.-n^.n " ' " CL m ,n} ~ {id} 

□ 
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3.3. The group Bw- We now relate Aw to Gw (in fact, to a slightly larger group 
depending on G and W). 

Lemma 3.4. Let W C F be given, and let g\,g2, • ■ • ,g-m be in Gw- There is an 
estimate 

m i ^'-'^r 1 ^^---^)\W)< 1 £l{g^- at] \W) 

Proof. By expanding, 

T Ttc; — n -a m -a, m -i n -a,2 bi-ai b 2 , . . n b m 

Liit> - g m g m _ l g 2 9i 92 9 m 

= g^-g'-T 1 ■ ■■9 b 2 2 - a2 ■ ■■gt(9i 1 - ai T 

= I[(9 b r a r 

i 

□ 



Remark 3.5. It would be nice to generalize Lemma 13.41 to certain homogeneous 
expressions 919% " • 9m whose product is in Gw, even if the individual gi are not. 
In fact, this can be done if m = 2, but not for higher m unless Gw — [G, G] (i.e. 
unless every commutator is a product of W- words). 
Suppose gh G Gw- Then 

g"h n = (ghr\ g n - 1 ,h]*\ g n - 2 ,hr---\g,h]* 
= (ghT{g n -\gh]*---ig,gh}* 

where we use the identity [a, 6] = [a, ab]*. Since any commutator [*, gh] is a product 
of at most 2 • \[gh\W) W-words, we see that \(g n h n \W) < 3n • \(gh\W). Moreover, 

(g™ 1 /!™ 1 )^ 1 ^" 2 ^™ 2 ) = {g n ^ ni h n2 - ni )* 
and we derive the inequality 

\{{g ni h n *)- l {g n *h n *)\W) < 3\n 2 - m| • \{gh\W) 



which is analogous to the estimate in Lemma 13.41 

On the other hand, in general the condition that xyz is in Gw does not imply that 
x n y n z n is in Gw for all n (in fact, for any n other than 0, 1), unless Gw = [G, G]. 
For example, if y, z are arbitrary and x is chosen so that xyz € Gw, then 

x~ 1 y~ 1 z~ 1 = x- 1 z~ 1 y~ 1 [y,z] 

Now, x^ 1 z~ 1 y~ 1 is in Gw because xyz is, so the left hand side is in Gw iff [y, z] 
is. But y and z are arbitrary, so this holds for all y, z if and only if Gw = [G, G], 
as claimed. 

Let Bw be the free group obeying the law W generated by the elements of Gw 
as a set, subject to the relation of homogeneity g n — g* n where 7k- is the group 
operation in Bw- There is a natural inclusion Gw — > Bw of sets, taking each 
element of Gw to the corresponding generator. 

Lemma 3.6. There is a homomorphism p : Bw — > Aw defined by the formula 

P(gi * 92 * ■ ■ ■ * 9m) = {<?" 92 ■■■9m} 
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Proof. From the definition of the group operation in Aw, the right hand side is 
equal to the product {<?" }{92 } ' ' ' {9m,} > so the map is a homomorphism. To see 
that it is well-defined, observe that all relations (i.e. W^-words, and the homogeneity 
relations) are satisfied in Aw by Proposition 13.31 so the map is well-defined. □ 

Remark 3.7. It is not typically true that this homomorphism is an injection, even 
if G is free. 

This homomorphism lets us map Gw to Aw by g — > {g n }- From the definitions, 
sl(g|W) = <i({id}, {5™}); in other words stable w-length can be recovered from the 
geometry of Aw and the map Bw — ^ Aw - 

3.4. Real structure. For any group G, let Bw ® R be the free group obeying the 
law W generated by expressions of the form g l where g 6 Gw and t 6 R, subject 
to the relation of homogeneity g s * g l — g s+t . We topologize Bw ® R with the 
weakest topology for which multiplication and inverse are continuous, as well as 
each homomorphism R — > Bw €3 R of the form t —> g*. 
We extend p : Bw — > Aw to p : Bw <8> R by defining 

g L ^{g^} 

on the K subgroups, and extending to arbitrary products by using the group mul- 
tiplication. Hence 

9 t 1 1 9 t 2 2 ---9 t m^{ 9 [ tini 9i t2ni ---9it nl } 

Notice that although g L tn -I is not necessarily the inverse of g L~*™J if tn is not an 
integer, nevertheless these elements are distance at most 21(g|PF) apart in Cw, and 
therefore {g^- tn ^} and {c/L— * T1 J | are inverse in Aw- 

Definition 3.8. The real cone Aw is the image of Bw <8> K in Aw- 

For the remainder of the paper we restrict attention to Aw- The metric on 
Aw restricts to a (bi- invariant) metric on Aw, consequently giving it the structure 
of a topological group. In the sequel, we use the notation ||g|| for d(id,g), where 

g e A w . 

The real structure on Bw <8> R gives rise to a natural (multiplicative) R action 
on Aw- 

Lemma 3.9. There is a continuous family of endomorphisms R x Aw — > Aw with 
the following properties: 

(1) the action is multiplicative — i.e. lx is the identity on Aw, and A x (/i x 
g) = (A/i) x g for A, fj, € R and g 6 Aw; and 

(2) for any g £ Aw and A £ M* , we have ||A x g\\ = |A| • \\g\\ 

Proof. Let B be the group freely generated by expressions of the form g l with g G G 
and ( £ M, subject to the relations g t g 8 = g t+s . There is a natural R action on 
G, given by A x g l = g xt . This action evidently preserves the Vl^-subgroup of B, 
and therefore descends to an automorphism of Bw €3 R. We claim that this action 
preserves the kernel of p, and thereby defines an action on Aw- In other words, we 
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need to show that if \\pigl 1 ■ ■ ■ g 1 ^ 


)\\ =0 then \\p(g^ 


• • • q xt 


1,1 ) =0 for any A. But 




= d({id},{g[ xtinl - 


am 


>nj }) 




= lim- l( ff [ Atl " J •• 
w n 


l\t m 
j rn 






= A-limil( ffl Ltl " J 


■■■git 





ui n 

= ^-\\p(g t i 1 ---gt)\\ 

(where the third line follows from the second by an approximate change of variables 
and elementary estimates) thereby establishing both claims. □ 

The existence of this R action has the following topological consequence. 

Lemma 3.10. Aw is contractible and locally contractible. 

Proof. The R action defines a deformation retraction of Aw to the identity element, 
where each element g moves along the path (1 — t) X g where t goes from to 1. 
This retraction takes the ball of radius r around the identity inside itself for all 
positive t, so Aw is locally contractible. □ 

Since Aw obeys the law W, so does Aw- However, under certain circumstances 
we can say much more about Aw ■ The main theorem of this section is the following: 

Theorem 3.11. Suppose Aw is nilpotent. Then it is a normed vector space (in 
particular it is abelian). 

Proof. If Aw is abelian, then the existence of the R action with the properties 
proved in Lemma 13.91 shows that Aw is a normed vector space, with norm || • ||. So 
it suffices to show Aw is abelian. 

For legibility, denote the nth element of the lower central series of Aw by A n , 
so that Ai — Aw and A n — for some n. Suppose we have shown for some integer 
k, for all real t, for all h G Ak and all g s Gw, that the commutator [pig 1 ), h] = id. 
Since Aw is generated by elements of the form p{g l ) this implies that A^+i = 0. 

Let h e Ak-x- Then 



W)M = [ P {g t l 2 )M[p(g t/2 )A[[h,p{g t,2 %p{g t/2 ) h ] by Lemma[ 

= [pig 1 ' 2 ), /i][p(.g t/2 ), h] because A k+1 = 

= [pig^^^pig^lh]- 1 by LemmaES 

= [pig t/2 ),h 2 ] because A k+1 = 

By induction, [pig l ),h] = [pig 1 ^ 2 ),h 2 ] for every positive integer m. Hence 

\\[p(g%h]\\ = [[[pig^lh 2 ^ = y/* m ■ ig-^TW < 2 • 2-">|| fl *[| 

Since m is arbitrary, [pig*), h] = id and therefore (assuming k > 2), A k = 0. Since 
Aw is nilpotent by hypothesis, A n = for some n, and therefore Aw is abelian, as 
claimed. □ 

As a corollary, one obtains the following propositions. 

Proposition 3.12. Suppose Aw is locally compact. Then it is a (finite dimen- 
sional) normed vector space. 
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Proof. Since Aw is connected, locally path connected (by Lemma f3. 10[) and locally 
compact by hypothesis, the Gleason-Montgomery-Zippin theorem (i.e. the affirma- 
tive solution of Hilbert's fifth problem [TTJ [TB]) shows that Aw is a Lie group. 
Since it obeys a law, it is necessarily solvable, since non-solvable Lie groups contain 
nonabelian free subgroups. 

By Theorem 13.111 it suffices to show it is nilpotent. Equivalently, we need to 
show that for each g G Gw an d t£K the conjugation action of p(g t ) on the derived 
subgroup A' is trivial. 

Suppose not, so that p{g l ) is a 1-parameter family of isometric automorphisms 
of A'. Since A' is finite-dimensional, the group of isometric automorphisms is 
compact. In particular, there are arbitrarily large values of t for which pig 1 ) is 
arbitrarily close to the identity in the isometry group of A 1 . 

By the compactness of the group of isometric automorphisms of A' there is a 
constant C so that if l is an automorphism with d(h, t{h)) < e for all h in the ball 
about the identity in A' of radius 1/2, then d(h, i(h)) < C ■ e for all h in the ball 
about the identity in A' of radius 1 (in fact, since the metric on A' is bi-invariant, 
we can take C = 2, but this is unnecessary for our argument). 

Now, for any positive e, there are arbitrarily large t such that d(h, h p ( g ') < e for 
all h in the ball of radius 1 in A' . Consequently, for all s G [1/2, 1] and all h in the 
ball of radius 1 in A' , 

d(s xh,sx (h^*))) = d{s xh,(sx /i) p(9 * S) ) < se 

It follows that d(h, h 9 ^^) < seC for all h in the ball of radius 1 in A', and all 

s G [1/2, 1]. It follows that d(h, hP^')) < 2eC for all t' G [0, 4/2]. 

Since e is arbitrarily small, since C is fixed, and since t can be chosen arbitrarily 
large for each e, it follows that the conjugation action of p(g l ) is trivial for all 
g G Gw and t G R. Hence A' is central, and therefore Aw is nilpotent, and 
therefore abelian and a finite dimensional normed vector space, as claimed. □ 

Remark 3.13. In fact, we do not really need the full power of Gleason-Montgomery- 
Zippin. We only need to know that the group of isometries of A' is compact (which 
follows from Arzela-Ascoli and local compactness) to deduce that A is abelian; 
then one may appeal to Pontriagin's solution of Hilbert's fifth problem for locally 
compact abelian groups. 

Recall the notation '■= [[x, y], [z, w]] for free generators x, y, z, w. 

Theorem 3.14. Suppose in some group G that stable commutator length vanishes 
identically. Then Aw is abelian ( and hence a normed vector space ) for W = fii . 

Proof. By Theorem 13.111 it suffices to show that Aw is nilpotent. We show it 
satisfies the law [x, [y, z]] = 0. 

Let /, g, h be arbitrary elements of Aw, and suppose / = p(/ 1 1 f% 2 ■ ■ ■ f^)- We 
write g = {g n }, h = {h n } and / = {/}" tlJ / 2 L ™* 2j •••/m* mJ }. By hypothesis, the 
commutator length of each expression f^ nt ^ is o(n). It follows that 

[/, [5, h}] = {[/} ntlJ ft t2i ■ ■ ■ ft tmi , [9n, Ml = Mn) (3 2 - words} = 



where we use Lemma [231 bullet (4) in the second step. Hence Aw is nilpotent and 
therefore abelian. □ 
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Remark 3.15. Recall that f3\ — [x, y], so that /3i -length is just commutator length. 
A similar argument shows that if stable /3„-length vanishes identically in G, then 
for W = Pn+i the group Aw obeys the law [x, f) n ]. 

3.5. 7„-quasimorphisms. For the remainder of this section we typically specialize 
to the case that w = 7„ (recall 72 = [x,y], 73 = [x, [y,z]] and so on). Inspired by 
the phenomenon of (generalized) Bavard duality in the theory of stable commutator 
length, we make the following definition: 

Definition 3.16. A homogeneous W -quasimorphism (hereafter W-hoq) is a Lips- 
chitz homomorphism from Aw to R. 

If cj> is a VF-hoq, let d(<f>) denote the optimal Lipschitz constant. 

For general W, it is not clear whether any homomorphisms from Aw to K exist, 
let alone Lipschitz ones. But when Aw is a normed vector space (which holds, for 
instance, if W = "f n , by Theorem 13. 11|) the Hahn-Banach theorem guarantees the 
existence of a rich supply of Vl^-hoqs. 

In fact, we have the following proposition: 

Lemma 3.17. For any n and any g € G n , there is an equality 

s%|7n) = snp^(p{g))/d{^>) 

where the supremum is taken over all "f n -hoqs cj>. 

Proof. This follows from the fact that ||p(g)|| = s\(g\j n ), together with Theo- 
rem [3T1] and the Hahn-Banach theorem. □ 

Whenever Aw is a vector space, it is spanned by the vectors p(g) for g € Gw- 
Since a W-hoq cj> is a homomorphism, such a function on Aw is determined by its 
values on p(g). Therefore, by abuse of notation, we think of a VF-hoq in this case 
as a function on Gw] thus the equality from proposition 13.171 would be expressed 
in the form sl(<7|7„) = sup^ cp(g)/d(cp). On the other hand, it is not evident from 
the definition how to recover (or to estimate) d(cfr) directly from the values of cp on 
the elements of Gw- We now address this issue. 

Definition 3.18. A weak 7„-hoq is a function cj> : G — > R for which there is a 
least non-negative real number D(cj>) (called the defect) satisfying the following 
properties: 

(1) (homogeneity) for any g G G and any n € Z there is an equality 4>(g n ) = 
ncp(g); 

(2) (quasimorphism) for any g,h £ G there is an inequality 

\4>(g) +4>{h) - cf)(gh)\ < D(0)mm(l( 5 |7„-i),l(/i|7n-i),l(^l7n-i)) 

Remark 3.19. For n = 2, this is precisely the classical definition of a homogeneous 
quasimorphism. For, 71 = x and therefore l(g|7i) = 1 if g 7^ id and otherwise. 

From the definition one can deduce some basic properties of weak 7„-hoqs. 

Lemma 3.20. Suppose G is perfect. Then any weak "/ n -hoq cj> : G — > K satisfies 
the following properties: 

(1) cj> is a class function; 

(2) if h is a j n -word, then cj>(h) < D{cj>); 
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(3) for any g £ G there is an estimate 

<P{g) < (2l(g\ ln ) - l)D{<j>) 

and consequently 

sl( 5 |7n) > <t>{9)l1D{4>) 

Proof. The hypothesis that G is perfect implies (inductively) that l(<jf|7 TO ) is finite 
for every g G G and every integer m. To see that 4> is a class function, observe that 
for any elements g,h G G and any n one has 

Mhgnh- 1 ) - <t>{h) - 0(.g") - 0(01 < 2U(#( ff |7 n _ 1 ) 

By homogeneity, the left hand side is equal to n ■ ^(hghT 1 ) — <fi(g)\ whereas the 
right hand side is a constant independent of n, thus proving the first claim. 

Now suppose h is a 7„-word; i.e. h — [x,y] for some x and y where y is a 
7n- i-word. Then 

\cj>{h) - ^(xyx' 1 ) - 4>{y~ l )\ < D(0(y\ 7n ^) = D(0) 

On the other hand, since is a class function and homogeneous, the left hand side 
is equal to \4>(h)\ and the inequality is proved. 

Finally, if we express g = hih n ---h m where each hi is a 7„-word, then by 
induction 0(g) < J2 4>{K) + {m-l)D((t>) < (2m-l)D(<j>). This proves bullet (3). □ 

Proposition 3.21. The set of weak j n -hoqs on G is a real vector space, which we 
denote n Q(G), and D(-) is a semi-norm. If G is perfect, n Q(G) is a Banach space 
with the norm -D(-). 

Proof. The defining properties of a weak 7„-hoq are an (infinite) system of linear 
equalities and inequalities. It follows by inspection that n Q{G) is a real vector 
space, and that D(-) is a semi-norm. 

If G is perfect, then \(g\j n _i) is finite for any g, and therefore D(<f>) = if and 
only if (j) is a homomorphism to K, which is necessarily trivial for G perfect. Thus 
-D(-) is a norm, and it remains to show that n Q(G) is complete in this norm. 

So, let <j>i be a sequence of weak 7 n -hoqs for which D(cj>i — <j)j) is a Cauchy 
sequence; i.e. for all e > there is an N so that £)(</> j — <j>j) < e for i, j > N. By 
bullet (3) from Lemma 13.201 it follows that the values of the on any element g 
form a Cauchy sequence, and therefore the <fn converge pointwise to a limit 4>. A 
pointwise limit of homogeneous functions is homogeneous; moreover, for any g, h 
pointwise convergence implies 

1(0 - 0j)(fl + h-gh)\ < limsupD(0i - 3 -)min(l(5r|7 n -i),l(^|7n-i),l(5^l7n-i)) 

i 

and therefore <f> is in n Q(G) and D{4>i ~ <t>) ~* 0, and the lemma is thereby proved. 

□ 

The following lemma justifies the terminology "weak 7„-hoq" . 
Lemma 3.22. If G is perfect, every j n -hoq is a weak "f n -hoq with D((f>) < d{4>). 
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Proof. Let <fi : A ln — > R be a d(0)-Lipschitz homomorphism, and by abuse of 
notation we let <fi : G — » R be the function denned by </>(<?) :— <fi(p(g)). We estimate 

\4>{g) + <f>(h) ~ 4>{gh)\ = \4>( P {g * h * (h-'g-'M 

<d(<t>)\\{g m h m h- 1 g- 1 ---h- 1 g- 1 }\\ 

= d((f>) lim -\{g m h m h- l g- x ---h- 1 g- 1 \ ln ) 

m^-oo rn 

<d(cp) lim -i([ 5 ,/i]*[5 2 ,/ir---[p m - 1 ,/iri7n) 

< d(4>)l(h\ Jn ^) 

By symmetry, we obtain the estimate 

\<t>(g) + <(>(h) - c^{gh)\ < d{4>) wa.{\{jsYln-i\ l(%n-i), %%»-i)) 
so that </> is a weak 7„-hoq with D((f>) < g?(0), as claimed. □ 

We therefore obtain the following generalization of Bavard duality. 

Theorem 3.23 (7„-Duality theorem). For any perfect group G, and any g E G 

there is an inequality 

su P 0( 5 )/£(0) > sl(s|7 B ) > su P 0(.g)/2^(0) 
where the supremum is taken over all weak "f n -hoqs. 

Proof. The upper bound follows from Lemma 13.221 and Lemma 13.171 and the lower 
bound from Lemma 13.201 □ 

An interesting application of this duality theorem is an a priori estimate of the 
ratio of scl(*) and sl(*|7„) in any perfect group. This estimate follows from the 
surprising fact that one may bound from below the stable 7„_i length of a word 
from the value of a weak 7 n -hoq, providing n > 3. 

Lemma 3.24. Let G be a perfect group. Then any weak "f n -hoq (j) ; G — > R satisfies 
the following properties: 

(1) if h is a ^n-i-word, then 4>(h) < D{<fi) 

(2) for any g G G there is an estimate 

ct>{g) < (21( 5 | 7 „-i) - l)D[ct>) 

and consequently 

sl(ff|7n-i) > 0Cff)/2I3(0 

Proof. Let g be arbitrary, and let k be a 7„_2-word. There is a constant C so that 
for any m we have \(g m k m (gk)~ m \j n ) < Cm and therefore by Lemma [3.201 we can 
estimate 

\^9 m k m (gk)- m )\ < Cm 
for some constant C' depending only on g, k and D(cf>). 
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On the other hand, we can write 

g m k m (gk)- rn = g m k m k~ 1 g~ 1 k' 1 g- 1 ■ ■ ■ k' l g~ l 
= g m k m - 1 g- 1 k- 1 ■ ■ ■ k- x g- x 
= g m k m - 2 g- 1 [g,k]g- 1 k- 1 ---k- 1 g- 1 
= g m k m - 2 g- 1 g- 1 k- 1 ---k- 1 g- 1 [g,k}* 
= 5 m fc™-V W'fc- 1 • • • fc-W k]*[g, k]*[g, k}* 
= product of m(m — l)/2 conjugates of [g, k) 

By abuse of notation, for any r we denote a product of r conjugates of [g, h] by 
[g,h]* r . Hence we have g m k m {gk)- m = [g, fc]*"»("»-i)/2 < n the other hand, for 
any r, 

Mg,kD - ^[g t k]* r - 1 ) -4>{[g,k])\ < D{ft 

by the defining property of an 7„-hoq, since is a class function, and \{k\^f n -2) = 1 
implies \{[g, k]\^f n -i) < 1. By induction and the triangle inequality we obtain 

\<f>([g,k}* r )-r<f>(lg,k})\ <rD(4>) 

Hence we conclude 

\<t>{g m k m {gk)- m ) - (m{m-l)/2)4>{[g,k})\ < (m(m - 1)/2)D{4>) 

Since \4>(g m k m (gk)~ m ' < Cm, dividing by m(m — l)/2 and taking m — > oo gives 
the estimate 

Mg,k})\ <D(q>) 

Since g was arbitrary, and k is an arbitrary 7„_2-word, we have proved bullet (1). 
Bullet (2) follows exactly as in Lemma [3.201 if g = h\h2 ■ ■ ■ h m where each hi is a 
7„_i-word, then by induction 

m < 4>{fH) + (m - l)D{ct>) < (2m - l)D(</>) 

This completes the proof of the lemma. □ 

From this easily follows our Comparison Theorem: 

Theorem 3.25 (Comparison theorem). For any perfect group G, for any n > 2 

and for any g G G there is an inequality 

2 n - 2 sc\(g) > sl( 5 | 7 „) > scl( ff ) 
Proof. From Lemma 13.241 and Theorem 13.231 we estimate 

2sl(<7| 7 „-i) > sup <l>{g)/D{<t>) > sl(g\ ln ) 

where the supremum is taken over all weak 7 n -hoqs <f>- By induction on n, we get 

2"- 2 scl( 5 ) > sl( ff | 7 „) 
The second inequality in the theorem is trivial, since every 7 n -word is a 72-word. □ 
It follows that weak 7„-hoqs can be used to estimate scl: 

Corollary 3.26. Let G be a perfect group, and <f> a weak ^n-hoq. Then for any 
g G G we have 

sc%) > ^G?)^"- 1 ^) 
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Stable commutator length has many important applications to geometry, topol- 
ogy, dynamics, etc. In general, estimating scl is very difficult; lower bounds are 
usually obtained from (ordinary homogeneous) quasimorphisms (equivalently, weak 
7 n -hoqs when n = 2), and by now many interesting constructions of such quasi- 
morphisms are known; see e.g. [6], especially Chapters 3 and 5. In view of Corol- 
lary [37261 it becomes interesting to ask whether there are any natural constructions 
of weak 7 n -hoqs with n > 2, arising from geometry or dynamics. 

The statement of the Comparison Theorem is purely algebraic, and it is therefore 
natural to try to find a purely algebraic proof, bypassing the construction of as- 
ymptotic cones, the use of Hahn-Banach, etc. In fact, it is not too hard to translate 
the geometric argument into an algebraic one, though the geometric argument has 
its own charm. The case of 73 is especially straightforward; with the same amount 
of work, one proves a slightly stronger statement. 

First, in any group G, let denote the set of words of the form [x, y] where 
y € [G, G]. The I"3 words generate the subgroup G3 (as do the 73 words, which 
are special examples of 1^ words), but there is no uniform comparison between 73 
length and 1^3 length. If G is perfect, then G = [G, G] = G2, so for such groups, 1^3 
words are nothing other than commutators, and 1(*|T3) = cl(*). 

Proposition 3.27. In any group G, for any g £ G3 there is an inequality 

2 S \(g\T 3 ) > s\(g\ l3 ) > s\(g\T 3 ) 

Proof. Recall (from Notation I2.12[) the notation 73 for an arbitrary product of 
73-words. 

Express g k as a product of commutators 

g k = [01 , 61] [02, 62] • • • [a m , b m ) 

where the bi are all in [G, G], and where m/k is as close to sl(<?|r 3 ) as we like. Then 
we have 

g 2 k = ([01,61] • • • [a m ,b m ]) 2 =[ai,&i] 2 ■ • • [a m , b m ] 2 7 3 [m2 } 
Now, for each i we have [a^, bi] 2 — [a 2 , 6^73^ and therefore 

[a^] 2 " = [alb^ll 2 "" } = [«f Ab 3 {2 "- 1} 
and therefore we can write 

2"fc r 2™ 7 ir 2" 7 1 r 2" 7 1 {m2"-m} 

g = [a x , bi] [a 2 , 6 2 ] • • • [a m , b m }^ 

We can estimate l([a 2 ™ , 61] [a 2 " , 62] • • • [a 2 ^ , b m ] I73) < J2i b 2 )' wmcn is a constant 
independent of n. Since m/k is as close as we like to scl(g), and n is arbitrary, we 
deduce sl(p|73) < 2scl(g) as claimed. □ 

We do not know whether there is an a priori comparison between sl(* I73) and 
scl(*) on elements of G3 for an arbitrary group. 

3.6. /?2-quasimorphisms. By Theorem 13.141 if G is a perfect group in which scl 
vanishes identically, then Ap 2 is a vector space, and s\(g\(3 2 ) = sup^ (p(g) / d(<f>) 
where the supremum is taken over all /?2-hoqs <f>. We would like to obtain a "weak" 
characterization of /32-hoqs on such groups, analogous to the definition of weak 
7„-hoqs. 
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Definition 3.28. A weak f3 2 -hoq is a function </> : G — > R for which there is a 
least non-negative real number D(<p) (called the defect) satisfying the following 
properties: 

(1) (homogeneity) for any g 6 G and any n G Z there is an equality 4>(g n ) — 
n<f>(g) 

(2) (quasimorphism) for any <?, /i e G there is an inequality 

|0(<?) + <XM - <t>(9h)\ < D{<t>) rmn(\(g\i3 2 ),\(h\i3 2 ), \(gh\[3 2 )) 

The following is the analogue of Lemma 13.221 

Lemma 3.29. If G is perfect and scl vanishes identically, every f3 2 -hoq is a weak 
(3 2 -hoq with D(<j>) < 2d(<f>). 

Proof. As in Lemma [3.221 we must estimate the /3 2 -length of g m h m (gh)~ m . As 
before, we have an identity 

g m h m (gh)- m = [g, h]*[g\ h}* • • • [g m ~\ h}* 

We can write h — [ai,&i] ••• [a r , b r ] where each ai,bi is a commutator, and r = 
l(h\/3 2 ). Then 

WM = \j, [ax, M V, [aa, 6a]]* ' ■ ■ W, K, W]]* 
By the Hall- Witt identity (i.e. bullet (5) from Lemma r2.2[) we can write 

[g j ^iM]} = [ah[bl(g j r}m,[<,(g j n 

and since a,, bi are both commutators by hypothesis, the right hand side is a product 
of two /?2-words. Hence 

l(g m h m (gh)- m \p 2 ) < 2ml(h\p 2 ) 
By symmetry, the lemma follows. □ 

The following is the analogue of Lemma l3.201 though the proof is more circuitous, 
and has something in common with that of Lemma 13.241 

Lemma 3.30. Suppose G is perfect, and scl vanishes identically in G. Then any 
weak fj 2 -hoq <j> : G — > K satisfies the following properties: 

(1) <j) is a class function; 

(2) if h is a fj 2 -word, then <p(h) < D((fi); 

(3) for any g € G there is an estimate 

<j>(g) < (2\(g\(3 2 ) - 1)D{4>) 

and consequently 

sl( ff |/3 2 ) > ^g)/2D{4>) 

Proof. To see that <f> is a class function, observe as in the proof of Lemma f3.20l that 
for any g, h and any n, 

IHhg^- 1 ) - 4(h) - 0( 5 ") - ^h- 1 )] < 2D(<f>)\(h\(3 2 ) 

By homogeneity, and the fact that the right hand side is constant independent of 
n, we see that (^(hgh -1 ) — <p(g); i.e. is a class function. 

Secondly, observe that ifl(g\f3 2 ) — 1 then <p([g,h]) < D(<p). This is because 

\4>(ghg~ 1 hr 1 ) - 4(g) - 4(hg- x h~ x )\ < D(<f>)\(g\(3 2 ) = D(<f) 
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and because </> is a homogeneous class function. If we let a denote the word 
[[it, v], [[x,y], [z,w]]j then a is both a /32-word, and the commutator of something 
with a (32 word. Thus by induction, \4>{g)\ < 2D(<fi)\(g\a). Since G is perfect, 
every element has a finite a-length, so for any g and h and any integer m we have 
\cj)(g m h m (gh)~~ m )\ < Cm for some constant C depending only on g, h and D((f>). 

On the other hand, as in the proof of Lemma T3.241 we can write g m h m (gh)~ m 
a a product of m{m — l)/2 conjugates of [g, h]. If g and h are both commutators, 
[g, h] is a /32-word, and therefore 

\<t>{g m h m {gh)- m ) - (m(m - l)/2)0([.g, < (m(m - l)/2)D(</>) 

and consequently |0([g, h])\ < D((p). Since g and h are arbitrary commutators, this 
shows that \<j)(g)\ < D{4>) f° r an Y Arword 5, proving the second claim. 

The third claim follows immediately from this, as in the proof of Lemma l3.201 □ 

Theorem 3.31 (/?2-Duality theorem). For any perfect group G in which scl van- 
ishes identically, and for any g £ G there is an inequality 

mp2<f>(g)/D(<fi) > sl( 5 |ft>) > su P 0( 3 )/2^(0) 

<i> 4> 

where the supremum is taken over all weak fa-hoqs. 

Proof. The upper bound follows from Lemma I3.29| and the lower bound from 
Lemma 13.301 □ 

We conclude this section by making a curious observation on the relation between 
sl(*|/32) and 1(*|73), under the hypothesis that scl vanishes identically. 

Proposition 3.32. Suppose G is perfect, and scl vanishes identically. If\{g\^) = 1 
then s\{g\p 2 ) < 1. 

Proof. Since for any a, b, c we have [a,6][a,c] = [a, 6c][[a, c], 6]* by bullet (3) of 
Lemma l2~2l it follows that 

[x, [y, z]][x, [y, z] n ] = [x, [y, z} n+1 ][[x, [y, z}% [y, z}}* 

for any n and any x, y, z. Since [[x, [y, z] n ], [y, z]] is a /32-word, it follows by induction 
that [x, [y, z]] n can be written as a product of (n — 1) /3 2 -words with [x, [y, z] n ]. If 
scl vanishes identically, [y, z] n can be written as a product of o(n) commutators. If 
G is perfect, x is a product of a finite number of commutators. Hence the ,82-length 
of [x, [y,z] n ] is o(n), and therefore sl([x, [y, 2;]] < 1 as claimed. □ 

It seems hard to generalize Proposition ^. 321 to estimate sl(*|/?2) from 1 ( * | T3 ) - 

3.7. Perfectness and virtual perfectness. Throughout this section we have 
usually made the assumption that G = Gw When W — j n for some n, this is 
equivalent to the statement that G is perfect. However it is evident that in most 
arguments it is sufficient to replace any given element g with a (fixed) power g m . 
In particular, the theorems in this section remain true under the weaker hypothesis 
that for every g £ G there is a positive integer m such that g m € Gw ', equivalcntly, 
the quotient G/Gw is torsion. For general W this is implied by, but weaker than, 
the condition that Gw has finite index in G, even if G is finitely generated. But 
for W — 7n, the two conditions are equivalent when G is finitely generated. 

Lemma 3.33. For any n, the quotient G/G n is torsion if and only if G/G2 is 
torsion. 
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Proof. One direction is obvious. We prove the other direction. Suppose G/G2 is 
torsion, and suppose by induction we have shown G/Gfc is torsion for some k > 2. 
Let g € Gfc, and write g = [01, &i][<Z2, 62] ■ ■ • [ir^r] where each 6 r G Gfe-i. Let m 
be such that 6™ G G fc for all i. Then 

g m = [a 1 ,b 1 ] m ---[a r ,b r } m mod G fc+ i 
= [a 1 ,6f]---[a r ,6^] mod G k+1 
= mod Gfc+i 

By induction, the lemma is proved. □ 

Thus, Theorem 13.251 remains true with "perfect" replaced by "virtually perfect" 
(or even by: "group whose abelianization is torsion" ) . 

Given a group G, one may attempt to obtain a lower bound on sl(*|'y n ) on 
elements of G„ by embedding G in a perfect group H, and using Theorem 13.231 
applied to H together with monotonicity of sl(*|*) under homomorphisms. It is 
sometimes easier to take H to be virtually perfect rather than perfect; we shall see 
an example in § 2] 

4. Gropes 

The purpose of this section is to show how hyperbolic geometry can be used to 
give straightforward lower bounds on 1 ( * | ^3 ) - We give some examples that show that 
the uniform comparisons in Theorem 13.251 and Proposition 13.271 for stable lengths 
do not have any analogue for unstable length. 

The arguments depend on the geometry of certain objects called gropes. We do 
not discuss here the most general kind of gropes, but only the simplest nontrivial 
examples. For a general introduction to gropes, see § 13 of [9]. Informally speak- 
ing, gropes topologize the commutator calculus, and questions about expressing 
elements in groups as products of j n (or j3 n ) words can be translated into questions 
about the existences of maps of certain kinds of gropes to spaces. We also use some 
elementary facts from the theories of CAT(— 1) complexes and pleated surfaces. A 
basic reference for the first is [2], especially pp. 347-362. A basic reference for the 
second in [35], Chapter 8. 

Let S n be an oriented surface of genus n with one boundary component. Let S n ^i 
be obtained by attaching a once-punctured torus to each of a maximal collection 
of pairwise disjoint homologically essential loops pi in S n (there are n such). We 
denote by dS n ,i the boundary of S n (contained in S n< i). 

Now, let G be a group, and let A be a space with 7Ti(X) = G. If g £ G is given, 
let 7 : S 1 — > A be a loop whose free homotopy class corresponds to the conjugacy 
class of g. Observe that from the definitions, 1(5(73) < n if and only if there is a 
map / : <SVi,i — > A such that df : dS nj i — > X factors through a homeomorphism 
h : <9SVi,i —> S 1 in such a way that 7 o h — df. Informally, the 73-length of g is at 
most n if and only if there is a map from S n ,i to X whose boundary wraps around 
7- 

Proposition 4.1. For each n, let F be free on the generators £1,3/1, • ■ • , x n ,y n , z 
and let w n = [z, [xi,yi] [£2,2/2] • ■ ■ [x ni y n ]}. Then l(to n |73) > 2n/3. 

Proof. We build a K(F, 1) (called X) as follows. Start with a hyperbolic once- 
punctured torus S with totally geodesic boundary. Let a be an embedded geodesic 
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in S representing the meridian. Take another hyperbolic surface S' of genus n with 
totally geodesic boundary of length length(a), and attach dS' isometrically to a. 
The resulting space X is a K(F, 1), and the conjugacy class of w n is represented 
by the boundary circle dS, which by abuse of notation, we denote dX . 

Since X is obtained by gluing convex hyperbolic 2-complexes along convex sub- 
sets, it is itself a CAT(— 1) 2-complex. If l(w n \j^) = m, there is a map / : S m .i — > X 
sending dS m .\ to dX. We homotop this map to a pleated representative in a special 
way. First we choose an ideal triangulation of S m for which the geodesic represen- 
tatives of the Pi (with notation as above) are contained in the pleating locus A. 
The map / can be homotoped on S m to take each leaf of A to a geodesic in X, and 
to be 1-Lipschitz on each ideal triangle of S m — A for some hyperbolic metric on 
S m . Then / can be homotoped rel. Uij3i to a pleated representative with respect 
to some hyperbolic metric on each once-punctured torus component of S m ,i — S m . 

The key property of a pleated map is that it is area non-increasing. Moreover, it 
is surjective onto X, since for any point p € X, the conjugacy class of w n is not in 
[iri(X —p), [tti{X —p),ni(X —p)]]. By Gauss-Bonnet, area(5 m ,i) = 2n ■ (3m — 1) 
and area(X) = 2ir-2n. Hence 3m — 1 > 2n and therefore m > 2n/3, as claimed. □ 

On the other hand, sl(w n |73) < 1 for all n, by Proposition 13.271 This example 
shows that the comparison theorem (Theorem 13 . 25[) has no analogue for unstable 
7 n -lengths, even if n = 3. 
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